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Abstract 

In this work, we construct the first locally-correctable codes (LCCs), and locally-testable 
codes (LTCs) with constant rate, constant relative distance, and sub-polynomial query com¬ 
plexity. Specifically, we show that there exist binary LCCs and LTCs with block length n, 
constant rate (which can even be taken arbitrarily close to 1), constant relative distance, and 
query complexity exp(0(-\/log n)). Previously such codes were known to exist only with 
query complexity (for constant /3 > 0), and there were several, quite different, constructions 
known. 

Our codes are based on a general distance-amplification method of Alon and Luby [AL96] . 
We show that this method interacts well with local correctors and testers, and obtain our main 
results by applying it to suitably constructed LCCs and LTCs in the non-standard regime of 
sub-constant relative distanee. 

Along the way, we also construct LCCs and LTCs over large alphabets, with the same query 
complexity exp(0(Vlogn)), which additionally have the property of approaching the Singleton 
bound: they have almost the best-possible relationship between their rate and distance. This 
has the surprising consequence that asking for a large alphabet error-correcting code to further 
be an LCC or LTC with exp(0(-\/logn)) query complexity does not require any sacrifice in terms 
of rate and distance! Such a result was previously not known for any o{n) query complexity. 

Our results on LCCs also immediately give locally-decodable codes (LDCs) with the same 
parameters. 
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1 Introduction 


Locally-correctable codes [BFLSQlj ISTVOll IKTOO] and locally-testable codes [FS95t IRS96t IGSOb] 
are codes that admit local algorithms for decoding and testing respectively. More specifically: 

• We say that a code C is a locally-correctable code (LCcfl if there is a randomized algorithm 
that, when given a string z that is close to a codeword c G C, and a coordinate i, computes Cj 
while making only a small number of queries to z. 

• We say that a code C is a locally-testable code (LTC) if there is a randomized algorithm that, 
when given a string z, decides whether 2 ; is a codeword of C, or far from C, while making 
only a small number of queries to z. 


The number of queries that are used by the latter algorithms is called the query complexity. 

Besides being interesting in their own right, LCCs and LTCs have also played important roles 
in different areas of complexity theory, such as hardness amplification and derandomization (see 
e.g. |STVnij ). and probabilistically checkable proofs [AS98t[ALM'*~98j . It is therefore a natural and 
well-known question to determine what are the best parameters that LCCs and LTCs can achieve. 

LCCs and LTCs were originally studied in the setting where the query complexity was either 
constant or poly-logarithmic. In those settings, it is believed that LCCs and LTCs must be very 
redundant, since every bit of the codeword must contain, in some sense, information about every 
other bit of the codeword. Hence, we do not expect such codes to achieve a high rate. In particular, 


in the setting of constant que^ complexity, it is known that linear LCCs cannot have constant 
rate [KTOOi IWdWOSl IWooOTji. and that LTCs with certain restrictions cannot have constant 


rate [DKlli rBSV12] . On the other hand, the best-known constant-query LCCs have exponential 
lengtlo, and the best-known constant-query LTCs have quasi-linear length (see e.g. [BS081 IDinOTl 
IVid15] i. 


It turns out that the picture is completely different when allowing the query complexity to 
be much larger. In this setting, it has long been known that one can have LCCs and LTCs with 
constant rate and query complexity O(n^) for constant /? > 0 [BFLS9l| IR,S96| . More recently, 
it has been discovered that both LCCs |KSY14l ICKS131 IHOWlSj and LTCs [Vidlll ICKS13] can 
simultaneously achieve rates that are arbitrarily close to 1 and query complexity O(n^) for an 
arbitrary constant /3 > 0. This is in contrast with the general belief that local correctability and 
testability require much redundancy. 

In this work, we show that there are LCCs and LTCs with constant rate (which can in fact be 
taken to be arbitrarily close to 1) and constant relative distance, whose associated local algorithms 
have query complexity and running time. We hnd it quite surprising in light of the fact that 
there were several quite different constructions of LCCs and LTCs [BFLS9H IR,S96( IKSY141 IVidlH 
ICKS131IHOW13] with constant rate and constant relative distance, all of which had n(n^) query 
complexity. 

Furthermore, we show that over large alphabets, such codes can approach the Singleton bound: 
they achieve a tradeoff between rate and distance which is essentially as good as possible for general 


^There is a closely related notion of locally decodable codes (LDCs) that is more popular and very well studied. 
All our results for LCCs hold for LDCs as well, see discussion at the end of the introduction. 

^ [KTOOI IWdWO,^ IWooO^ proved a lower bound for the related notion of LDCs. Since every linear LCC is also 
an LDC, their lower bound applies to linear LCCs as well. 

®For example, a constant-degree Reed-Muller code is such an LCC. 
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error-correcting codes. Such a result was previously not known for any o(n) query complexity. 
This means that, remarkably, local correctability and local testability with queries over large 
alphabets is not only possible with constant rate and constant relative distance, but it also does 
not require “paying” anything in terms of rate and relative distance. 

We first state our theorems for the binary alphabet. 

Theorem 1.1 (Binary LCCs with sub-polynomial query complexity). For every r E (0,1), there 
exist (5 > 0 and an explicit infinite family of binary linear codes {Cn}^ satisfying: 

1. Cn has block length n, rate at least r, and relative distance at least 5, 

2. Cn is locally correctable from ^-fraction of errors with query complexity and running time at 
most exp(-^log n ■ log log n). 

Theorem 1.2 (Binary LTCs with sub-polynomial query complexity). For every r E (0,1), there 
exist 6 > 0 and an explicit infinite family of binary linear codes {C„}^ satisfying: 

1. Cn has block length n, rate at least r, and relative distance at least 6, 

2. Cn is locally testable with query complexity and running time at most exp(-^log n ■ log log n). 

The binary LCCs and LTCs in the above theorems are obtained by first constructing LCCs and 
LTCs over large alphabets, and then concatenating them with binary codes. The following theorems 
describe these large alphabet LCCs and LTCs, which in addition to having sub-polynomial query 
complexity, also approach the Singleton bound. 

Theorem 1.3 (LCCs with sub-polynomial query complexity approaching the Singleton bound). 
For every r E (0,1), there exists an explicit infinite family of linear codes {Cn}n satisfying: 

1. Cn has block length n, rate at least r, and relative distance at least 1 — r — o(l), 

2. Cn is locally correctable from -fraction of errors with query complexity and running 

time at most exp(-^logn • log log n), 

3. The alphabet of Cn is of size at most exp(exp(-y/log n ■ log log n)). 

Theorem 1.4 (LTCs with sub-polynomial query complexity approaching the Singleton bound). 
For every r E (0,1), there exists an explicit infinite family of linear codes {Cn}n satisfying: 

1. Cn has block length n, rate at least r, and relative distance at least 1 — r — o(l), 

2. Cn is locally testable with query complexity and running time at most exp(-v/log n • log log n), 

3. The alphabet of Cn is of size at most exp(exp(-\/log n ■ log log n)). 

The above theorems are proved in Sections [3] and [H 

Remark 1.5. If we were only interested in LCCs and LTCs with O(n^) query complexity (for 
arbitrary fi), we could have constructed binary codes that meet the Zyablov bound, which is the 
best-known rate-distance tradeoff for explicit binary codes. Furthermore, we could have constructed 
codes over constant-size alphabet that approach the Singleton bound (rather than having alphabet 
of super-constant size). 

Moreover, our results imply the existence of non-explicit binary LCCs/LTCs with query com¬ 
plexity exp(-^log n ■ log log n) that meet the Zyablov bound. This follows by concatenating the 
codes of Theorems o and oi with (non-explicit) Gilbert-Varshamov codes [Gil52l IVar57j . 


3 













The Alon-Luby distance-amplification. Our constructions are based on the distance-amplification 
technique of [AL96| . This distance amplifier, based on a d-regnlar expander, converts an error- 
correcting code with relative distance S> l/d into an error-correcting code with larger relative 
distance 5, while reducing the rate only by a factor of ~ (1 — <5). Thus for a large enough constant 
d, if we start with a code of rate 1 — e and relative distance ^ 1 /d, where e <C 5, then after distance 
amplification with a d-regular expander, we get a code with rate (1 — J) (1 — e) (1 — ^) and relative 
distance 5. 

The original application of this techniqne in [AL96] was to construct linear-time erasure- 
decodable codes approaching the Singleton bound. In addition to the above distance-amplification 
technique, |AL96| constructed a linear-time erasnre-decodable code (not approaching the Singleton 
bound) which could be used as the input code to the amplifier. The main result of [AL96] then fol¬ 
lows from the fact that distance amplification via a constant-degree expander preserves linear-time 
erasure-decodability. 

Subsequent applications of this distance-amplification technique followed a similar outline. One 
first constructs codes with high rate with some (possibly very small) constant relative distance and 
a certain desirable property. Then, applying distance amplification with a (possibly very large) 
constant-degree expander, one obtains a code with a much better tradeoff between its rate and 
relative distance. Finally one shows that the distance amplification with a constant degree expander 
preserves the desirable property. This scheme was implemented in [0105) . who constructed codes 
that can be decoded in linear time from errors (rather than erasures), and in [GI021IGR08] . who 
constructed capacity-achieving list-decodable codes with constant alphabet. 

Our observations. The first main observation of this paper is that the distance-amplification 
technique also preserves the property of being an LGG or an LTG. Specifically, if we start with 
an LGG or LTG with query complexity q, and then apply distance amplification with a d-regular 
expander, then the resulting code is an LGG/LTG with query complexity q ■ poly(d). 

The next main observation is that this connection continues to hold even if we take d to be 
super-constant, and take the LGG or LTG to have sub-constant relative distance Q{l/d) (and then 
we only require the LGG to be able to correct strings whose distance from the code is within some 
constant fraction of the minimum distance of the code). This is potentially useful, since we only 
blow up the query complexity by a factor of poly(d), and perhaps LGGs/LTGs with high rate 
and sub-constant relative distance can have improved query complexity over their constant relative 
distance counterparts. 

Finally, we show that existing families of high rate LGGs and LTGs can achieve sub-polynomial 
query complexity if we only require them to have sub-constant relative distance. Specifically, 
multiplicity codes [KSY14] in a super-constant number of variables give us the desired LGGs, and 
super-constant-wise tensor products |Vidll| give us the desired LTGs. 

As far as we are aware, there have been no previous uses of this distance-amplification technique 
using an expander of super-constant degree. 

More generally, we wish to draw attention to the technique of |AL96] . We believe that it should 
be viewed as a general scheme for improving the rate-distance tradeoff for codes with certain 
desirable properties. In particular, it may transfer properties that codes with constant rate and 
sub-constant relative distance are known to have, to codes with constant rate and constant relative 
distance, and even to codes approaching the Singleton bound. We believe that this is a good 
“take-home message” from this work. 


4 






















Correctable and testable codes. Using the above method, it is also possible to construct 
improved codes that are simultaneously locally correctable and locally testable. This can be done 
by applying the distance-amplification technique to the lifted Reed-Solomon codes of [GKS13] . The 
codes of |GKS13] are both locally correctable and testable, and achieve rates that are arbitrarily 
close to 1. Using these codes of |GKS13] in the sub-constant relative distance regime, and combining 
with our framework, we get codes of constant rate and constant relative distance (which over large 
alphabets approach the Singleton bound) that are both locally correctable and locally testable with 

^0(1/loglogn) 


Locally decodable codes. An important variant of LGGs are locally decodable codes (LDGs). 
Those codes are defined similarly to LGGs, with the following difference: Recall that in the definition 
of LGGs, the decoder gets access to a string z which is close to a codeword c, and is required to 
decode a coordinate of c. In the definition of LDGs, we view the codeword c as the encoding of 
some message x, and the decoder is required to decode a coordinate of x. LDGs were studied 
extensively in the literature, perhaps more so than LGGs (see [Yekl2] for a survey). One notable 
fact about LDGs is that there are constructions of LDGs with a constant query complexity and 
sub-exponential length [YekOSi |Rag07| i IKYOQl IEfrl2] . 

If we restrict ourselves to linear codes, then LDGs are a weaker object than LGG^ since every 
linear LGG can be converted into an LDG by choosing a systematic encoding mapo. Since the 
LGGs we construct in this paper are linear, all our results apply to LDGs as well. 


Organization of this paper. We review the required preliminaries in Section [2l construct our 
LGGs in Section [3l and construct our LTCs in Section IH We conclude with some open questions 
in Section [5l 


Version. A preliminary version of this paper appeared as [Meil4] , where the distance-amplification 
technique was used to construct codes approaching the Singleton bound with query complexity 
O(n^) (for arbitrary /3 > 0). 


2 Preliminaries 

All logarithms in this paper are in base 2. For any n G N we denote [n] {1... ,n}. We denote 

by F 2 the finite field of two elements. For any finite alphabet S and any pair of strings G S"", 
the relative Hamming distance (or, simply, relative distance) between x and y is the fraction of 

def 

coordinates on which x and y differ, and is denoted by dist(x,7/) = |{z G [n] : Xj 7^ yi} \ /n. We 
have the following useful approximation. 

Fact 2.1. For every x, y G M such that 0 < x ■ y < 1, it holds that 

(1 — xY < 1 — - ■ X ■ y. 

Proof. It holds that 

(1 — xY < e~^'y < 1 — ^ • X • y. 

^This conversion will lead to an LDC with the same query complexity, but the running time of the local decoder 
will be small only if the systematic encoding map can be computed efficiently. 
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The second inequality relies on the fact that 1 — ^ ■ x > e~^ for every x € (0,1), which can be 
proved by noting that 1 — | • x = at x = 0, and that the derivative of e~^ is smaller than that 
of 1 — I • X for every x G (0, 1). The first inequality relies on the fact that 1 — x < for every 
X G M, which can be proved using similar considerations. ■ 

2.1 Error correcting codes 

Let S be an alphabet and let n be a positive integer (the block length). A code is simply a subset 
C C S”. If F is a finite field and S is a vector space over F, we say that a code C C S” is F-linear 
if it is an F-linear subspace of the F-vector space S"". The rate of a code is the ratio ; which 

for F-linear codes equals • 

The elements of a code C are called codewords. We say that C has relative distance at least 
5 if for every pair of distinct codewords ci,C 2 G C it holds that dist(ci,C 2 ) > 5. We will use the 
notation dist(r(;, C) to denote the relative distance of a string w from C, and say that w is 

e-close (respectively, e-far) to C if dist(t(;,C') < e (respectively, if dist(tc,C') > e). 

An encoding map for C is a bijection Ec : ^ C, where |S|^ = \C\. We say that an infinite 

family of codes {Cn}^ is explicit if there is a polynomial time algorithm that computes the encoding 
maps of all the codes in the family. For a code C of relative distance 6, a given parameter r < 5/2, 
and a string 2 G S”', the problem of decoding from r fraction of errors is the task of finding the 
unique c G C (if any) which satisfies dist(c, z) < r. 

Reed-Solomon codes. We use the following fact, which states the existence of Reed-Solomon 
codes and their relevant properties. 

Fact 2.2 (Reed-Solomon Codes |RS60j ). For every A:, n G N such that n > k, and for every finite 
field F such that |F| > n, there exists an F-linear code RSk,n ^ with rate r = k/n, and relative 
distance at least 1 — > 1 — r. Furthermore, RSk,n has an encoding map F : F^ —)• RSk,n which 

can he computed in time poly(n, log |F|), and can be decoded from up to (1 — ^^)/2 fraction of 
errors in time poly(n, log |F|). 

2.2 Locally-correctable codes 

Intuitively, a code is said to be locally correctable |BFLS9ll ISTVOll IKT00| if, given a codeword 
c G C that has been corrupted by some errors, it is possible to decode any coordinate of c by 
reading only a small part of the corrupted version of c. Formally, it is defined as follows. 

Definition 2.3. We say that a code C C is locally correctable from r-fraction of errors with query 
complexity q if there exists a randomized algorithm A that satisfies the following requirements: 

• Input: A takes as input a coordinate i G [n] and also gets oracle access to a string z G S” 
that is r-close to a codeword c G C. 

• Output: A outputs c* with probability at least |. 

• Query complexity: A makes at most q queries to the oracle z. 

We say that the algorithm A is a local corrector of C. Given an infinite family of LCCs a 

uniform local corrector for the family is a randomized oracle algorithm that given n, computes the 
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local corrector of Cn- We will often be also interested in the running time of the uniform local 
corrector. 

Remark 2.4. The above success probability of | can be amplified using sequential repetition, 
at the cost of increasing the query complexity. Specifically, amplifying the success probability to 
1 — requires increasing the query complexity by a factor of 0{t). 

2.3 Locally-testable codes 

Intuitively, a code is said to be locally testable |FS95[ IRS961 IGSOOj if, given a string z E S”, it is 
possible to determine whether z is a codeword of C, or rather far from C, by reading only a small 
part of z. There are two variants of LTCs in the literature, “weak” LTCs and “strong” LTCs. Prom 
now on, we will work exclusively with strong LTCs, since it is a simpler notion and allows us to 
state a stronger result. 

Definition 2.5. We say that a code C C is (strongly) locally testable with query complexity q if 
there exists a randomized algorithm A that satisfies the following requirements: 

• Input: A gets oracle access to a string z E S”. 

• Completeness: If z is a codeword of C, then A accepts with probability 1. 

• Soundness: If z is not a codeword of C, then A rejects with probability at least dist(z, C). 

• Query complexity: A makes at most q non-adaptive queries to the oracle z. 

We say that the algorithm d is a local tester of C. Given an infinite family of LTCs {Cn}^, a 
uniform local tester for the family is a randomized oracle algorithm that given n, computes the local 
tester of Cn- Again, we will often also be interested in the running time of the uniform local tester. 

A remark on amplifying the rejection probability. It is common to define strong LTCs 
with an additional parameter p, and have the following soundness requirement: 

• If z is not a codeword of C, then A rejects with probability at least /?-dist( 2 ;, C). 

Our definition corresponds to the special case where p = 1- However, given an LTC with p < 1, it 
is possible to amplify p up to 1 at the cost of increasing the query complexity. Hence, we chose to 
fix p to 1 in our definition, which somewhat simplifies the presentation. 

The amplification of p is performed as follows: The amplified tester invokes the original tester A 
for ^ times, and accepts only if all invocations of A accept. Clearly, this increases the query 
complexity by a factor of ^ and preserves the completeness property. To analyze the rejection 
probability, let z he a string that is not a codeword of C, and observe that the amplified tester 
rejects it with probability at least 

1 — (I — p • dist(z, C))p 

> 1— ^1 — - •- •p - dist(z, (Fact 12.ip 

= dist( 2 :, C), 


as required. 
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2.4 Expander graphs 

Expander graphs are graphs with certain pseudorandom connectivity properties. Below, we state 
the construction and properties that we need. The reader is referred to |HLW06j for a survey. For 
a graph G, a vertex s and a set of vertices T, let E{s,T) denote the set of edges that go from s 
into T. 

Definition 2.6. Let G = {U L}V,E) he a. bipartite d-regular graph with |f7| = 
that G is an (a, 7 )-sampler if the following holds for every T <ZV: For at least 1 
vertices s £ U it holds that 

|E(«,T)| |r| ^ 

— -V - 

Lemma 2.7. For every a, 7 > 0 and every sufficiently large n G N there exists a bipartite d-regular 
graph Gn,a,'y = {U U V,E) with \U\ = |F| = n and d = poly such that Gn,a,'y is an ( 0 , 7 )- 

sampler. Furthermore, there exists an algorithm that takes as inputs n, a, 7 , and a vertex w of 

Gn,a,'y, o.nd computes the list of the neighbors of w in Gn,a,'y in time poly(^^;^). 

Proof sketch. A full proof of Lemma [2.71 requires several dehnitions and lemmas that we have not 
stated, such as second eigenvalue, edge expansion, and the expander mixing lemma. Since this is 
not the focus of this paper, we only sketch the proof without stating those notions. The interested 
reader is referred to |HLW06j . 

def 

Let a, 7 and n be as in the lemma. We sketch the construction of the graph G = Gn,a,'y First, 
observe that it suffices to construct a strongly-explicit non-bipartite graph G' over n vertices (that 
is, a graph G' in which the neighborhood of any given vertex is computable in time poly(logn)) 
with the desired property. The reason is that each such graph G' can be converted into a bipartite 
graph G with the desired property, by taking two copies of the vertex set of G' and connecting the 
two copies according to the edges in G'. The existence of the algorithm stated in the lemma follows 
from the fact that G' is strongly-explicit. 

We thus focus on constructing the graph G'. This is done in two steps: first, we show how 
to construct a strongly-explicit expander G" over n vertices - this requires a bit of work, since n 
can be an arbitrary number, and expanders are usually constructed for special values of n. In the 
second step, we amplify the spectral gap of G" by powering, and set G' to be the powered graph. 
We then prove that G' has the desired sampling property. 

The first step. The work of |GG81| gives a strongly-explicit expander with constant degree and 
constant edge expansion for every n that is a square, so we only need to deal with the case in which 
n is not a square. Suppose that n = wf — k, where is the minimal square larger than n, and 
observe that k < 2m — 1, which is at most ^ for sufficiently large m. Now, we construct an 

expander over vertices using |GG81] . and then merge k pairs of vertices. In order to maintain 

the regularity, we add self-loops to all the vertices that were not merged. We set G" to be the 
resulting graph. 

It is easy to see that G" is a regular graph over n vertices. Since the merge and the addition 
of self-loops maintain the degree and the edge expansion of the original expander up to a con¬ 
stant factor, it follows that G" is an expander with constant degree and constant edge expansion. 
Furthermore, it is not hard to see that G” is strongly-explicit. 


= |F| = n. We say 
— a fraction of the 











The second step. Since G" is an expander, and in particular has constant edge expansion, it 
follows from the Cheeger inequality [Dod84[ IAM85j that its second-largest normalized eigenvalue 
(in absolute value) is some constant smaller than 1. Let us denote this normalized eigenvalue by 
A. We note that the degree and the edge expansion of G", as well as A, are independent of n. 

We now construct the graph G' by raising G” to the power log;,, {y/a ■ 7 ). Observe that G' is a 

graph over n vertices with degree d poly and normalized second eigenvalue yfa ■ 7 . It is 

not hard to see that G' is strongly-explicit. 


The sampling property. We prove that G' has the desired sampling property. Let T be a 
subset of vertices of G'. We show that for at least (1 — a) fraction of the vertices s of G' it holds 


that 


d 



n 


To this end, let 


S = 


|s E [/ 


d 


1^1 ^ \ 
— >1) ■ 

n ) 


Clearly, it holds that 

i^T)! 

d-|,S| n 

On the other hand, the expander mixing lemma [AC 88 ] implies that 


|i?(5,T)| |T| ^ ^ 


By combining the above pair of inequalities, we get 


7 < y/a ■ 7 • 
l^l < a • |T| < a • n, 

as required. ■ 



3 LCCs with sub-polynomial query complexity 

In this section, we prove the following theorem on LCCs, which immediately implies Theorem 11.31 
from the introduction. 

Theorem 3.1 (Main LCC theorem). For every r E (0,1), there exists an explicit infinite family 
of ¥ 2 -linear codes {Gn}^ satisfying: 

1. Gn has block length n, rate at least r, and relative distance at least 1 — r — o(l). 

2. Gn is locally correctable from fraction of errors with query complexity exp(-^log n ■ log log n). 

3. The alphabet ofCn is a vector space ouer F 2 , such that |S„| < exp (exp(v^log n ■ log log n)). 
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Furthermore, the family {Cnjj^ has a uniform local corrector that runs in time exp(-^log n • log log n). 

We note that the existence of binary LCCs (Theorem 11 . 11 ) also follows from Theorem 13.11 In 
order to construct the binary LCCs, we concatenate the codes of Theorem 13.11 with any asymp¬ 
totically good inner binary code that has efficient encoding and decoding algorithms. The local 
corrector of the binary LCCs will emulate the original local corrector, and whenever the latter 
queries a symbol, the binary local corrector will emulate this query by decoding the corresponding 
codeword of the inner code. Since such constructions are standard (see |KSY14| L we do not provide 
the full details. 

The proof of Theorem 13.11 has two steps. In the first step, we give a transformation that 
amplifies the fraction of errors from which an LCC can be corrected - this step follows the distance 
amplification of |AL96] . In the second step, we construct a locally-correctable code Wn with the 
the desired query complexity but that can only be corrected from a sub-constant fraction of errors. 

Finally, we construct the code Cn by applying the distance amplification to Wn- Those two steps 
are formalized in the following pair of lemmas, which are proved in Sections 13.11 and 13 . 21 respectively. 

Lemma 3.2. Suppose that there exists a code W that is locally correctable from Tw fraction of 
errors with query complexity q, such that: 

• W has rate r^. 

• W is ¥ 2 -linear 

Then, for every 0 < r < ^ and 0 < e < 1, there exists a code C that is locally correctable from 
T fraction of errors with query complexity q ■ poly(l/(e • Tw)), such that: 

• l^^l = l^^l- 

• C has relative distance at least 2 • t, 

• Let A denote the alphabet of W. 
poly(l/(e -Tw))- 

• C is ¥ 2 -linear. 

Furthermore, 

• There is a polynomial time algorithm that computes a bijection from every code W to the 
corresponding code C, given r^, Tw, t, e and A. 

• There is an oracle algorithm that when given black box access to the local corrector of any 
code W, and given also r^, t, e. A, computes the local corrector of the corresponding 
code C. The resulting local corrector of C runs in time that is polynomial in the running time 
of the local corrector ofW and in IjTw, 1 /e and\og{nw) where nw is the block length ofW. 

Lemma 3.3. There exists an explicit infinite family of¥ 2 -linear codes {Wn}^ satisfying: 

1. Wn has block length n, rate at least 1 — and relative distance at least Ll 

2. Wn is locally correctable from Ll fraction of errors with query complexity exp(-^log n ■ log log n). 


log log n 
loe^ n 


and rate at least • (1 — 2 • r — e). 

def 

Then, the alphabet of C is T, = A^ for some p = 
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3. The alphabet ofWn is a veetor spaee over ¥ 2 , such that |A„| < exp (exp(-^log n ■ log log n)). 


Furthermore, the family {Wn}^ has a uniform local corrector that runs in time exp(-^log n ■ log log n). 


Proof of Theorem 13.IL We construct the family by applying Lemma 13.21 to the fam¬ 
ily {Wn}n of Lemma EAl with Tw = ^ ^ 


T = 


1 

2 ' 



2 '(^ ^(logn))- 


It is easy to see that Cn has the required rate, relative distance and alphabet size, and that it can 
be locally corrected from the required fraction of errors with the required query complexity. The 
family {Cnj^ is explicit with the required running time due to the first item in the “furthermore” 
part of Lemma 13.21 and has a uniform local corrector due to the second item of that part. ■ 

Remark 3.4. In Lemma 13.21 above, we chose to assume that W is F 2 -linear for simplicity. More 
generally, if W is F-linear for any finite field F, then C is F-linear as well. Furthermore, the lemma 
also works if W is not F-linear for any field F, in which case C is not guaranteed to be F-linear for 
any field F. 


3.1 Proof of Lemma 13.21 

3.1.1 Overview 

Let 0 < r < Our goal is to construct a code C that can be locally corrected from a fraction of 
errors at most r. The idea of the construction is to combine the LCC W with a Reed-Solomon code 
to obtain a code C that enjoys “the best of both worlds”; both the local correctability of W and 
the good error correction capability of Reed-Solomon. We do it in two steps: first, we construct a 
code C which can be corrected from r fraction of random errors. Then, we augment C to obtain 
a code C that can be corrected from r fraction of adversarial errors. 

We first describe the construction of C. To this end, we describe a bijection from W to C. 
Let rc be a codeword of W. To obtain the codeword c' E C that corresponds to re, we partition w 
into blocks of length b (to be determined later), and encode each block with a Reed-Solomon code 
We choose the relative distance of RSb^d to be 2 • r -|- e, so its rate is 1 — 2 • r — e and the 
rate of C is indeed • (I — 2 • r — e), as required. 

We now claim that if one applies to a codeword c' E C" a noise that corrupts each coordinate 
with probability r, then the codeword d can be recovered from its corrupted version with high 
probability. To see it, first observe that with high probability, almost all the blocks of d have at 
most T -|- I fraction of corrupted coordinates. Let us call those blocks “good blocks”, and observe 
that the good blocks can be corrected by decoding them to the nearest codeword of RSb^d (since 
r -|- I is half the relative distance of RSb^d)- Next, observe that if b is sufficiently large, the fraction 
of “good blocks” is at least I — and hence we can correct the remaining Tyy fraction of errors 
using the decoding algorithm of W. It follows that C can be corrected from r fraction of random 
errors, as we wanted. 

Next, we show how to augment C to obtain a code C that is correctable from adversarial errors. 
This requires two additional ideas. The first idea to apply a permutation that is “pseudorandom” 
in some sense to the coordinates of C. The “pseudorandom” permutation is determined by the 
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edges of an expander graph (see Section [23]) . This step is motivated by the hope that, after the 
adversary decided which coordinates to corrupt, applying the permutation to the coordinates will 
make the errors behave pseudorandomly. This will allow the above analysis for the case of random 
errors to go through. 

Of course, on its own, this idea is doomed to fail, since the adversary can take the permutation 
into account when he chooses where to place the errors. Here the second idea comes into play: 
after applying the permutation to the coordinates of C', we will increase the alphabet size of the 
code, packing each block of symbols into a new big symbol. The motivation for this step is that 
increasing the alphabet size restricts the freedom of the adversary in choosing the pattern of errors. 
Indeed, we will show that after the alphabet size is increased, applying the permutation to the 
coordinates of the code makes the errors behave pseudorandomly. This allows us to prove that the 
code can be decoded from r fraction of errors, as we wanted. 

3.1.2 The construction of C 

Choosing the parameters. Let W, r^, r, e, and A be as in Lemma 13.21 Let {Gn}^ be an 
infinite family of {tw, ^ • e)-samplers as in Theorem 12.71 and let d be their degree. 

Recall that we assumed that W is F 2 -linear, so |A| is a power of 2. Let F be an extension field 
of F 2 , whose size is the minimal power of |A| that is at least d. Let RS^^d be a Reed-Solomon code 
over F with relative distance 2 • r + e, rate 1 — 2 • r — e, and block length d. 

Let Uw be the block length of W, and let t be such that |F| = |A|*. The block length of C will 

be n and its alphabet will be S F*^. Here, we assume that is divisible hy b ■ t. If 

is not divisible by 6 • t, we consider two cases: 

• if Tiw > b ■ tje, we increase to the next multiple of 6 • t by padding the codewords of W 
with additional zero coordinates. This decreases the rate of W by at most e, which essentially 
does not affect our results. 

• Otherwise, we set C to be any Reed-Solomon code with blocklength Uw^ relative distance 2-r, 
and rate 1 — 2 • r. Observe that such a Reed-Solomon is locally correctable from r fraction of 
errors with query complexity 


Uw <b-tje = poly(l/(e • Tw)), 
which satisfies our requirements. 

A bijection from W to C. We construct the code C by describing a bijection from W to C. 

Given a codeword w E W, one obtains the corresponding codeword c E C as follows: 

• Partition w into n blocks of length b ■ t. We view each of those blocks as a vector 

in F^, and encode it via the code RSb,d- Let us denote the resulting string by c' E F’^''^ and 
the resulting codewords of RS^^d by Hi, ..., Bn E F'^. 

• Next, we apply a “pseudorandom” permutation to the coordinates of c' as follows: Let Gn 
be the graph from the infinite family above and let U = {ui, ..., Un} and V = {ui, . . ., Vn} 
be the left and right vertices of Gn respectively. For each i E [n] and j E [d], we write the 
j-th symbol of Bi on the j-th. edge of Ui. Then, we construct new blocks Si,... ,Sn E F'^, by 
setting the j-th symbol of Si to be the symbol written on the j-th. edge of u*. 


12 


• Finally, we define the codeword c of C C S"" as follows: the z-th coordinate Cj is the block 
Si, reinterpreted as a symbol of the alphabet S F'^. We choose c to be the codeword in C 
that corresponds to the codeword w mW. 

This concludes the definition of the bijection. It is not hard to see that this bijection can be 
computed in polynomial time, and that the code C is F 2 -linear. Furthermore, S = F'^ = where 
d-t< dlogd = poly(l/(e • tw))- The rate of C is 

log|g| 

n • log IS 


as required. The relative distance of C is at least 2 • r ~ although this could be proved directly, it 
also follows immediately from the fact that C is locally correctable from r fraction of errors, which 
is proved in the next section. 

3.1.3 Local correctability 

In this section, we complete the proof of Lemma 13.21 bv proving that C is locally correctable from 
T fraction of errors with query complexity poly(d) • q. To this end, we describe a local corrector A. 
The algorithm A is based on the following algorithm Aq, which locally corrects coordinates of W 
from a corrupted codeword of C. 

Lemma 3.5. There exists an algorithm Aq that satisfies the following requirements: 

• Input: Aq takes as input a eoordinate i G [n^], and also gets oracle aecess to a string z 
that is T-elose to a eodeword c £ C. 

• Output: Let he the codeword of W from which c was generated. Then, Aq outputs wf 
with probability at least 1 — 

• Query complexity: Aq makes poly((i) • q queries to the oraele z. 

Before proving Lemma 13.51 we show how to construct the algorithm A given the algorithm Aq. 
Suppose that the algorithm A is given oracle access to a string 2 ; that is r-close to a codeword c G C, 
and a coordinate i G [n]. The algorithm is required to decode c*. Let G be the codeword 
of W from which c was generated, and let B^,..., and Sf ,..., 5^ be the corresponding blocks. 

In order to decode Cj, the algorithm A should decode each of the symbols in the block Sf G F'^. 
Let Ujj^,..., Uj^ be the neighbors of vi in the graph Gn. Each symbol of the block Sf belongs to one 
of the blocks ..., B^^, and therefore it suffices to retrieve the latter blocks. Now, each block B^^ 
is the encoding via oi b ■ t symbols of (in the alphabet A). The algorithm A invokes the 

algorithm An to decode each of those b ■ t symbols of w^, for each of the blocks ,... ,B^ . By the 

J1 ' Jd 


log|VE| 
n- d - log |F| 
rw ■ log \hA^\ 
n ■ d ■ log |F| 

log|A| 

' d' 


n 


log |F| 

I-2-r-e 


{b-t) 
(1-2-r-e), 
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union bound, the algorithm Aq decodes all those b-t - d symbols of correctly with probability at 
least 1 — b-t-d- 3 :^^ = Whenever that happens, the algorithm A retrieves the blocks ..., 
correctly, and therefore computes the block Sf correctly. This concludes the construction of the 
algorithm A. Note that the query complexity of A is larger than that of Aq by a factor of at 
most b ■ t ■ d, and hence it is at most poly((i) • q. It remains to prove Lemma 13.51 

Proof of Lemma 13.51 Let A^r be the local corrector of the code W. By amplification, we may 
assume that Ayy errs with probability at most 3 :^^, and this incurs a factor of at most poly(d) to 
its query complexity. 

Suppose that the algorithm Aq is invoked on a string z G and a coordinate i G [riw]- The 
algorithm Aq invokes the algorithm A^ to retrieve the coordinate i, and emulates Ay^, in the natural 
way: Recall that A^y expects to be given access to a corrupted codeword of W, and makes queries 
to it. Whenever Ayr makes a query to a coordinate iw G [nw], the algorithm Aq performs the 
following steps. 

1. Aq finds the block Bi to which the coordinate iw belongs. Formally, / \iw/{b ■ t)l • 

2. Aq hnds the neighbors of the vertex ui in Gn- Let us denote those vertices by ..., Vj^. 

3. Aq queries the coordinates ji,.. ■ jd, thus obtaining the blocks Sj ^,..., Sj^. 

4. ^0 reconstructs the block Bi by reversing the permutation of Gn on Sj ^,..., Sj^. 

5. ^0 attempts to decode Bi by applying an efficient decoding algorithm of Reed-Solomon. 

6 . Suppose that the decoding succeeded and returned a codeword of RS^^d that is (r + |)-close 
to Bi. Then, Aq retrieves the value of the iwAh. coordinate of from the latter codeword, 
and feeds it to Aw as an answer to its query. 

7. Otherwise, Aq feeds 0 as an answer to the query of Aw- 

When the algorithm Aw hnishes running, the algorithm ^0 finishes and returns the output of Aw- 
It is not hard to see that the query complexity of Aq is at most d times the query complexity of Aw , 
and hence it is at most poly(d) • q. It remains to show that Aq succeeds in decoding from r fraction 
of errors with probability at least 1 — 3 :^^. 

Let z G B” be a string that is r-close to a codeword c G C. Let w'^ G be the codeword 
of W from which c was generated, and let Bf ,..., B^ and S ^,..., 5^ be the corresponding blocks. 
We also use the following definitions: 

1. Let Sf ,..., 5^ G F'^ be the blocks that correspond to the symbols of z. 

2. Let Bf ,..., Bf^ be the blocks that are obtained from Sf ,..., 5^ by reversing the permutation. 

3. Define blocks Bf ,..., Bf as follows: if Bf is (r + |)-close to RSb^d, then Bf' is the nearest 
codeword of RSb,d- Otherwise, Bf' is the all-zeroes block. 

4. Let G be the string that is obtained by extracting the coordinates of w from each of 
the codewords Bf ,..., Bf'. 
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It is easy to see that Aq emulates the action of on w^. Therefore, if we prove that is r^iz-close 
to w^, we will be done. In order to do so, it suffices to prove that for at least 1 — Tw fraction of the 
blocks Bf, it holds that is (r + |)-close to Bf. 

To this end, let J be the set of coordinates on which z and c differ. In other words, for every 
j G J it holds that Sj ^ Sj. By assumption, \ J\ < T-n. Now, observe that since Gn is a {tw, 5 • e)- 
sampler, it holds that for at least (1 — fraction of the vertices ui of Gn, there are at most 
(r + 1 ) • d edges between ui and J. For each such ui, it holds that B^^ is (r + |)-close to B^^, and 
this concludes the proof. ■ 

It can be verified that the local correctors Aq and A can be implemented efficiently with black 
box access to A-^, as required by the second item in the “furthermore” part of the lemma. 


3.2 Proof of Lemma 13.31 

In this section we prove Lemma 13.31 restated below. 


Lemma 13.31 There exists an explieit infinite family of¥ 2 -linear codes {IFn}„ satisfying: 

1. Wn has block length n, rate at least 1 — and relative distance at least P 

2. Wn is loeally correctable from Q fraction of errors with query complexity exp(-v/log n ■ log log n). 

3. The alphabet ofWn is a veetor spaee An over¥ 2 , such that |A„| < exp (exp(v^log n ■ log log n)). 

Furthermore, the family {IFn}„ has a uniform local corrector that runs in time exp(v^log n • log log n). 

For the proof of Lemma 13.31 we use the multiplicity codes of |KSY14j . in a specialized sub¬ 
constant relative distance regime. 


log log n 
loff^ n 


Lemma 3.6 ( [KSY14] Lemma 3.5]). Let¥ be any finite field. Let s,d,m be positive integers. Let 
M be the multiplicity code of order s evaluations of degree d polynomials in m variables over F. 

def , H. , , 


Then M has bloek length IFI*^ 
is at least 


relative distanee at least <5 = 1- and rate 


/ s-\-m — l\ 

\ m J 


which 


m + s 


s-|F| 


m 




The alphabet of G is ¥^ ^ ), and G is F-/inear. Furthermore, there is poly (F™", ^)) time 

algorithm that eomputes an encoding map of M given s, d, m, and¥. 


Lemma 3.7 ( |KSY14l Lemma 3.6]). Let M be the multiplicity code as above. Let 6=1 — be 

a lower bound for the relative distance of M. Suppose |F| > max{10 • m, 12 • (s -|- 1)}. Then 

M is locally correctable from <5/10 fraction of errors with query complexity 0{s'^ ■ |F|). 


As discussed in Section 4.3 of |KSY14| . this local corrector can be implemented to have running 
time poly(|F| , s™) over fields of constant characteristic. In fact, |Kopl4| shows that the query 
complexity and running time for local correcting multiplicity codes can be further reduced to 
|F| • O ((j)™") queries, but this does not lead to any noticeable improvement for our setting. 

We now prove Lemma 13.31 
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Proof. Let n G N be a codeword length. We set the code Wn to be a multiplicity code with the 
following parameters. We choose F to be a field of size and choose m = 

Note that indeed |F|™^ = n. We choose s = 2-m? - logre. Let 6 = 2 -mAogn (this will be a lower bound 
on the relative distance of the code) and choose the degree of the polynomials to be d = s - |F| •(! —<5). 


It can be verified that the relative distance of the code is at least d > fl 
of the code is at least 


(yW)' 


1 - 


rre 


• (1 - d)™ > 1 - 


1 


2 • log re 


1 - 


1 


2-m ■ log re 


> 1 - 


log re’ 


as required. The alphabet size is 


|F|( m ) < exp ^-\/logre • log log re • s 

-\/logre • log log re 


= exp 


log r 


/ log^ re log log 

\ log log re 


= exp 


^exp ^y^log re • log log re^ ^ . 


Moreover, the alphabet is a vector space over F and hence in particular over F 2 (since we chose the 
size of F to be a power of 2). The code Wn is F-linear and in particular F 2 -linear. 

By Lemma I.S.71 Wn is locally correctable from ^ • (5 > n ("sj fraction of errors with 

query complexity 


0{s^ ■ |F|) < O 




log^ re 


Vlog log 


re 


/ log n 

log log n 


2viognogio^ 


20( Vlog n-log log n) 


as required. Finally, the fact that the family {Wn}n is explicit follows from the “furthermore” part 
of Lemma [3.6l and the fact that it has an efficient uniform local corrector with the required running 
time follows from the discussion after Lemma 13.71 ■ 


3.3 LDCs 

As remarked earlier, by choosing a systematic encoding map, linear LCCs automatically give LDCs 
with the same rate, relative distance, and query complexity. The running time of the local decoding 
algorithm will be essentially the same as the running time of the local correction algorithm, provided 
that the systematic encoding map can be computed efficiently. Using the fact that multiplicity codes 
have an efficiently computable systematic encoding map |Kopl2| , it is easy to check that the codes 
we construct above also have an efficiently computable systematic encoding map. Thus we get 
LDCs with the same parameters as our LCCs. 


4 LTCs with sub-polynomial query complexity 

In this section, we prove the following theorem on LTCs, which immediately implies Theorem 11.41 
from the introduction. 
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Theorem 4.1 (Main LTC theorem). For every r G (0,1), there exists an explicit infinite family of 
¥ 2 -linear codes {Cn}^ satisfying: 

1. Cn has block length n, rate at least r, and relative distance at least 1 — r — o(l). 

2. Cn is locally testable with query complexity exp(-^log n • log log n). 

3. The alphabet ofCn is a vector space over¥ 2 , such that |S„| < exp(exp (-^log n ■ log log n)). 

Furthermore, the family {Cnfn ® uniform local tester that runs in time exp(y^logn • log log n)). 

We note that the existence of binary LTCs (Theorem 11.21) also follows from Theorem 14.11 In 
order to construct the binary LTCs, we concatenate the codes of Theorem 14.11 with any asymp¬ 
totically good inner binary code that has efficient encoding and decoding algorithms. The local 
tester of the binary LTCs will emulate the original local tester, and whenever the latter queries a 
symbol, the binary local tester will emulate this query by reading the corresponding codeword of 
the inner code. If this string is not a legal codeword, the binary tester will reject, and otherwise it 
will decode the symbol and feed it to the original tester. Since such constructions are standard, we 
do not provide the full details. 

The proof of Theorem 14.11 has two steps. In the first step, we give a transformation that 
amplifies the relative distance of an LTC - this step follows the distance amplification of [AL96] . 
In the second step, we construct a locally-testable code Wn with the desired query complexity but 
that has sub-constant relative distance. Finally, we construct the code Cn by applying the distance 
amplification to Wn- Those two steps are formalized in the following pair of lemmas, which are 
proved in Sections 14.21 and 14.81 respectively. 

Lemma 4.2. Suppose that there exists a code W with relative distance 5w that is locally testable 
with query complexity q such that: 

• W has rate r^y. 

• W is ¥ 2 -linear. 

Then, for every 0 < 5, e < 1, there exists a code C with relative distance at least 8 that is locally 
testable with query complexity q ■ poly(l/(e • 8w)), such that: 

• l^^l = |M^I- 

• C has rate at least r^v ■ {1 — 8 — e). 

def 

• Let A denote the alphabet of W. Then, the alphabet of C is H = hP for some p = 
poly(l/(e • 8w))- 

• C is ¥ 2 -linear. 

Furthermore, 

• There is a polynomial time algorithm that computes a bijection from every code W to the 
corresponding code C, given r^, 8w, r, e and A. 
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• There is an oracle algorithm that when given black box access to the local tester of any code W, 
and given also r^, dw, r, e, A, and the block length of W, computes the local tester of the 
corresponding code C. The resulting local tester of C runs in time that is polynomial in the 
running time of the local tester of W and in Ifdw, 1/e and log(nvi^) where nw is the block 
length of W. 


Lemma 4.3. There exists an explicit infinite family of¥ 2 -linear codes satisfying: 

1. Wn has block length n, rate at least 1— and relative distance at least exp(—-^log n • log log n). 

2. Wn is locally testable with query complexity exp(v^log n ■ log log n). 

3. The alphabet of Wn is a vector space A„ over F 2 , such that | A„| < exp (y^logn • log log n). 

Furthermore, the family {Wn}^ has a uniform local tester that runs in time exp(v^log n • log log n). 

Proof of Theorem 14.IL We construct the family by applying Lemma 14.21 to the fam¬ 
ily {VPn}„ of Lemma 03] with 6w = , e = and 


5 = 1--e = l — r — O 

1 - 

log n 



It is easy to see that Cn has the required rate, relative distance and alphabet size, and that it can 
be locally tested with the required query complexity. The family {Cn}n is explicit due to the first 
item in the “furthermore” part of Lemma 14.21 and has a uniform local corrector with the required 
running time due to the second item of that part. ■ 


Remark 4.4. In Lemma 14.21 above, as in Lemma 13.21 we chose to assume that W is F 2 -linear 
for simplicity. More generally, if W is F-linear for any finite field F, then C is F-linear as well. 
Furthermore, the lemma also works if W is not F-linear for any field F, in which case C is not 
guaranteed to be F-linear for any field F. 


4.1 Proof of Lemma 14.21 

Our construction of the LTC C is the same as the construction of the LCCs of Section EH with 
and T replaced by 5vt^/2 and 6/2 respectively. Our LTCs have the required rate, relative distance 
and alphabet size due to the same considerations as befor^. 

It remains to prove that C is locally testable with query complexity q ■ poly(l/(£ • 6^)). To this 
end, we describe a local tester A. In what follows, we use the notation of Section [3.1.21 

Let Aiy be the local tester of W. When given oracle access to a purported codeword 2 G S"", 
the local tester A emulates the action of Aiy in the natural way: Recall that A^y expects to be 
given access to a purported codeword of W, and makes queries to it. Whenever A^y makes a query 
to a coordinate j G [nw]-, the algorithm A performs the following steps: 

1. A finds the block Bi to which the coordinate j belongs. Formally, I \j/{b ■ t)]. 

®In particular, the lower bound on the relative distance of our LTC C follows from the lower bound on the relative 
distance given in Lemma f3.2[ using the fact that our LTC W has a (trivial, inefficient) nw query local corrector from 
&wl‘2 fraction errors. Again, this lower bound on the distance could have been argued directly, without talking about 
locality. 
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2. A finds the neighbors of the vertex ui in Gn- Let us denote those vertices by Ujj,..., Vj^. 

3. A queries the coordinates ji,... jd, thus obtaining the blocks 

4. A reconstructs the block Bi by reversing the permutation of Gn on Sj-^,, Sj^. 

5. If Bi is not a codeword of RSb^d^ the local tester A rejects. 

6 . Otherwise, A retrieves the value of the j-th coordinate of w from Bi, and feeds it to A^r as 
an answer to its query. 

If A^y finishes running, then A accepts if and only if A^, accepts. 

It is easy to see that the query complexity oi A is d ■ q. It is also not hard to see that if 
z is a legal codeword of G, then A accepts with probability 1. It remains to show that if z is 
not a codeword of G then A rejects with probability at least dist( 2 ;,C'). To this end, it suffices 
to prove that A rejects with probability at least • dist( 2 ;,C') - as explained in Section IXHl 

this rejection probability can be amplified to dist( 2 ;, G) while increasing the query complexity by a 
factor of poly((i), which is acceptable. We use the following definitions: 

1. Let ,..., E F'^ be the blocks that correspond to the symbols of z. 

2. Let BI ,..., B^ E be the blocks that are obtained from ,..., 5^ by reversing the permu¬ 
tation. 

3. Let E (A U {?})”'^ be the string that is obtained from the blocks Sf,... ,B^ as follows: 
for each block that is a legal codeword of RSb,d, we extract from Sf the corresponding 
coordinates of in the natural way. For each block that is not a legal codeword of RSb,d, 
we set the corresponding coordinates of to be “?”. 


We would like to lower bound the probability that A rejects z in terms of the probability that Ayy 
rejects w^. However, there is a small technical problem: A^y is defined as acting on strings in 
and not on strings in (A U {?})”'^. To deal with this technicality, we define an algorithm A'^ that, 
when given access to a string y E (A U emulates A^r on y, but rejects whenever a query is 

answered with “?”. We use the following proposition, whose proof we defer to Section 14.1.11 


Proposition 4.5. rejects a string y E (A U {?})”'^ with probability at least 

i • min{dist(y,IT),(5H^} . 


Now, it is not hard to see that when A is invoked on z, it emulates the action of on w^. To 
finish the proof, note that since each coordinate in W has at most d coordinates of G that depend 
on it, it holds that 

dist( 2 ;, G) ■ n < d ■ dist(t(;^, IT) • n^i/ 


and therefore 


dist(u;^,IT) > 


Tl \ 1 

— • - • dist(z, G) > - - - ■ dist(z, G). 

Uw d b ■ t■d 


It thus follows that A rejects 2 with probability at least 


1 

2 


• min{dist(r(;^, IT), (5 vk} > 


1 

poly(d) 


dist(z, G), 
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as required. 

It is not hard to see that the local tester A can be implemented efficiently with black box access 
to as required by the second item in the “furthermore” part of the lemma. 


4.1.1 Proof of Proposition 14.51 

We use the following result. 


Claim 4.6. Let I C [uw] be a set of coordinates, 
with probability at least 


min 



Uv 


The algorithm A^y 



queries some coordinate in I 


Note that this claim only makes sense since we assumed that Ayy makes non-adaptive queries (we 
assumed it in Definition 12.5p . Without this assumption, the probability that Ayy queries some 
coordinate in I would have depended on the tested string. 

Proof. It suffices to prove that for every I C [uw] such that ^ < \ ■ 6w, the algorithm Ayy 

queries some coordinate in I with probability at least Let I be such a set, and let s G 
be an arbitrary string that contains non-zero values inside I, and contains 0 everywhere outside I. 
Clearly, 

dist(s, W) = 

Uw 

and therefore Ayy rejects s with probability at least On the other hand. Aw can only reject s 
if it queries some coordinate in I, since otherwise it cannot distinguish between s and the all¬ 
zeroes codeword. It follows that Aw queries some coordinate in I with probability at least as 
required. ■ 


We turn to proving Proposition 14.51 Let 


E {i : y, =?} 


be the set of erasures in y. We consider two cases: 

• Li is “large”: Suppose that ^ ^ • dist(y, W). In this case, it holds by Claimthat Aw 

queries some coordinate in E with probability at least 

1 

- • min{dist(?/, • 


Since rejects y whenever Aw queries some coordinate in E, the proposition follows. 

• Li is “small”: Suppose that ^ ^ ■ dist(?/, W). Let yo G be an arbitrary string that 

agrees with y outside E. Clearly, 

dist(y, W) < dist(yo, W) + —, 

Uw 
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so dist(yo, W) > ^ •dist(y, W). Let £ denote the event that A-^ queries some coordinate in E. 
We have that 

Pr [A'^ rejects y] = 

> 

> 

> 

as required. 

This concludes the proof. 

4.2 Proof of Lemma 14.31 

In this section, we prove Lemma 14.31 restated below. 

Lemma 14.31 There exists an explieit infinite family of¥ 2 -linear codes {Wnj^ satisfying: 

1. Wn has block length n, rate at least 1— and relative distance at least exp(—Vlogn • log log n). 

2 . Wn is locally testable with query complexity exp(-v/log n ■ log log n). 

3. The alphabet of Wn is a vector space An over F 2 , such that | A„| < exp (y^log n ■ log log n). 

Furthermore, the family {VLn}„ has a uniform local tester that runs in time exp(-^log n • log log n). 

For the proof of Lemma 14.31 we use the tensor product codes instantiated in the sub-constant 
relative distance regime. The use of tensor products to construct LTCs was initiated by [BS06j . 
and was studied further in |Val05l IDSW061 IBV09b[ IBVOQal IVidll] . Our construction is based on 
a result of [Vidll] . 

We start with some definitions. Let F be a finite field. For a pair of vectors hi G and /12 G F^^ 
their tensor product hi ® /12 denotes the matrix M G entries ' (^ 2)12 

for every ii G [ii] and i 2 G [.^ 2 ]- For a pair of linear codes Hi C F^i and H 2 F F^^ their tensor 
product code Hi 0 H 2 C jg defined to be the linear subspace spanned by all matrices of the 

form hi (g) /i 2 where hi G Hi and /12 G 4^2- For a linear code H, let H^ = H and H'^ = H^~^ (g) H. 
The following are some useful facts regarding tensor product codes (see e.g. [DSWOGj l. 

Fact 4.7. Let Hi C F^i and H 2 C F^^ be linear codes of rates ri,r 2 and relative distances 61,62 
respectively. Then Hi® H 2 T a linear code of rate ri ■ r 2 and relative distance 61 ■ 62 - In 

particular, if H is a linear code of rate r and relative distance 6 then H"^ C F^"* is a linear 
code of rate r™' and relative distance 6 ^. 

We use the following theorem that is given as Corollary 3.6 in [Vidll] . 

Theorem 4.8 (Immediate corollary of [Vidlll Thm. 3.1]). Let H C.W^ be a linear code with relative 
distance 6 . Then for every m > 3, the code H"^ C F^"" is locally testable with query complexity 

f ■poly{m)/6‘^^. 


Pr [£] • Pr [A(y rejects y\£] + Pr [-.T] • Pr [A'^ rejects y\^£] 
Pr [T] • 1 -|- Pr [-iT] • Pr [A^, rejects yo\^£] 

Pr [£] ■ Pr [Aw rejects yol^j + Pr • Pr [^w rejects 

Pr [Aw rejects yo] 

dist(yo,lT) 

^ • dist(y,lT), 
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For the proof of Lemma 14.31 we instantiate Theorem 
Solomorj^ codes. 


with the tensor product of Reed- 


Proof of Lemma 14.31 Fix a codeword length n G N. The code Wn is defined as follows. Let 
F F^v iogn-iogiogn , and let m y^ iogiogn - R be a Reed-Solomon code over F with block 

length n^'rate r = (1 — j and relative distance 1 — r. Note that indeed the block length 
is at most |F|, which is required for the existence of such codes. Finally, let Wn = R™'- 

From the properties of tensor codes we have that Wn is a linear code over F with block length 

and relative distance 


(1-r)- = 


log n ’ 


log n 


l/m'' 


> 


1 


4 • m • log n 

m 


ww 

( - - — 

\ 4 • m • log n ^ 

2” 0 (m-(log m+log log n)) 


(Fact 12.11 : (1 — x)^ < 1 — i • x • y) 


_ 2“0(Vlog n-log log n) 

as required. The fact that Wn can be encoded in time poly(n) follows from standard properties of 
tensor product codes (see e.g. [SudOU Lecture 6]). 

Finally, by Theorem 14.81 we have that Wn is locally testable with query complexity at most 


/ 1 \ -2m 

. poly(m) • - = 20 (Viogn-iogiogn)^ 

\ 4 • m • log n / 

as required. The fact that the family {VFn}„ has a uniform local tester with the required running 
time follows immediately from the proof of [Vidll| . ■ 


5 Open Questions 


We conclude with some open questions. 

• In this work we found that LCCs and LTCs with sub-constant relative distance can be useful. 
Are there better LCCs and LTCs in the sub-constant relative distance regime? 

• LCCs and LTCs often come together with PCPs. Can we construct constant-rate PCPs with 
sub-polynomial query complexity? 

• Are there applications of our LCCs and LTCs to complexity theory? 
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®We chose Reed-Solomon codes for convenience, but any high-rate codes with reasonable distance will do. 
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